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(57) Abstract 

The invention relates to compounds of general structure (I) or salts thereof, wherein B is a nucleobase. X and Z independently are 
oxygen or sulphur. Y is hydrogen or hydroxy, which optionally may be protected. Ri is hydrocarbyl. which optionally is substituted with 
a functional group. R 2 is hydrogen or hydrocarbyl. which optionally is substituted with a functional group. A is an electron wuhdrawing 
or electron donating group capable of moderating the acetal stability of compound 0). Li and Li are hydrocarbon linkers which may be 
the same or different. L 2 . when present, being either (i) connected to L, via the group A. or (ii) directly connected to L, the group A then 
being connected to one of linkers Li and L 2 . F is a dye label. Q is a coupling group for F. and 1. m and n independently are 0 or 1. with 
the proviso that 1 is 1 when m is 1. and 1 is 1 and m is 1 when n is 1. The compounds of formula (I) are useful as deactivatable chain 
extension terminators. The inventi n also relates to the use of the c mpounds 0) in nucleic acid synthesis and nucleic acid sequencing as 
well as to a method of preparing compounds of Formula (I). 
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NOVEL CHAIN TERMINATORS, THE USE THEREOF FOR NUCLEIC ACID 
SEQUENCING AND SYNTHESIS AND A METHOD OF THEIR PREPARATION 

The present invention relates to novel nucleic acid 
5 chain extension terminators, their use in nucleic acid 
sequencing and synthesis, respectively, as well as a 
method for preparing such compounds . 

Today, there are two predominant methods for DNA 
sequence determination: the chemical degradation method 
10 (Maxam and Gilbert, Proc. Natl. Acad. Sci. 71:560-564 

(1977), and the dideoxy chain termination method (Sanger 
et al., Proc. Natl. Acad. Sci. 71:5463-5467 (1977)). Most 
automated sequencers are based on the chain termination 
method utilizing fluorescent detection of product 
15 formation. In these systems either primers to which 

deoxynucleotides and dideoxynucleotides are added are dye- 
labelled, or the added dideoxynucleotides are 
f luorescently labelled. As an alternative, dye labelled 
deoxynucleotides can be used in conjunction with unlabeled 
20 dideoxynucleotides. This chain termination method is based 
upon the ability of an enzyme to add specific nucleotides 
onto the 3 ' hydroxyl end of a primer annealed to a 
template. The base pairing property of nucleic acids 
determines the specificity of nucleotide addition. The 
25 extension products are then separated electrophoretically 
on a polyacrylamide gel and detected by an optical system 
utilizing laser excitation. 

Although both the chemical degradation method and the 
dideoxy chain termination method are in widespread use, 
30 there are many associated disadvantages. For example, the 
methods require gel-electrophoretic separation. Typically, 
only 400-800 base pairs can be sequenced from a single 
clone. As a result, the systems are both time- and labor- 
intensive. Methods avoiding gel separation have been 
3 5 developed in attempts" to increase the sequencing 
throughput . 

Sequencing by hybridization (SBH) methods have been 
proposed by Crkvenjakov (Drmanac et al . , Genomics 1:114 
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(1989); Strezoska et al . , (Proc. Natl. Acad. Sci. USA 
j££.:10089 (1991)), Bains and Smith (Bains and Smith, J. 
Theoretical Biol. 12JL:303 (1988)) and in US-A-5 , 202 , 231 . 
This type of system utilizes the information obtained from 
5 multiple hybridizations of the polynucleotide of interest, 
using short oligonucleotides to determine the nucleic acid 
sequence. These methods potentially can increase the 
sequence throughput beacuse multiple hybridization 
reactions are performed simultaneously. To reconstruct the 
10 sequence, however, an extensive computer search algorithm 
is required to determine the most likely order of all 
fragments obtained from the multiple hybridizations. 



15 sequence composition of the duplex of the oligonucleotide 
and the polynucleotide of interest, so that GC-rich 
regions are more stable than AT-rich regions. As a result, 
false positives and false negatives during hybridization 
detection are frequently present and complicate sequence 

20 determination. Furthermore, the sequence of the 

polynucleotide is not determined directly, but is inferred 
from the sequence of the known probe, which increases the 
possibility for error. 



25 addition or removal of single molecules from a DNA strand. 
For example, Hyman E.D., Anal. Biochem. , 174.: 423 
(1988) discloses the addition of a nucleotide to a an 
immobilised DNA template/primer complex in the presence of 
a polymerase and determination of polymerisation reaction 

30 by detecting the pyrophosphate liberated as a result of 
the polymerisation. 

Jett et al . , J- Biomol. Struct. Dyn., I, p. 301, 1989 
discloses a method wherein a single stranded DNA or RNA 
molecule of labelled nucleotides, complementary to the 

35 sequence to be determined, is suspended in a moving flow 
stream. Individual bases are then cleaved sequentially 
from the end of the suspended sequence and determined by a 
detector passed by the flow stream. 



The SBH methods are problematic in several respects. 
For example, the hybridization is dependent upon the 



Methods have also been proposed which detect the 
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EP-A-223 618 discloses the use of an immobilised DNA 
template, primer and polymerase exposed to a flow 
containing only one species of deoxynucleotide at a time. 
A downstream detection system then determines whether 

5 deoxynucleotide is incorporated into the copy or not by 

detecting the difference in deoxynucleotide concentrations 
entering and leaving the flow cell containing the complex 
of DNA template and polymerase. 

WO 90/13666 proposes a method directly measuring the 
10 growth of the template copy rather than determining it 

indirectly from compositions in the flow medium. Only one 
of the four nucleotides is present at a time, and the 
polymerisation events reflecting the incorporation of a 
nucleotide or not are detected by spectroscopic means 

15 (evanescent wave spectroscopy, fluorescence detection. 

absorption spectroscopy) or by the individual nucleotides 

being labelled. 

Similar methods employing labelled 3 ■ -blocked 
deoxynucleotides where the blocking group is removable and 
20 which thus permit sequential deoxynucleotide 

addition/detection steps are disclosed in WO 91/06678, US- 
A-5,302,509, DE-A-414 1178 and WO 93/21340. However, the 
necessary 3 • -blocking groups are either not described in 
any detail, or are not accepted by the required enzyme, or 
25 do not permit desired rapid deblocking of the growing 
template copy strand after each polymerisation event. 

One object of the present invention is to provide 
novel nucleotide derivatives which may be used as chain 
terminators and which by deprotection may readily be 
30 converted into nucleotides or nucleotide analogues that 
may be further extended. 

Another object of the present invention is to provide 
a method for nucleotide sequence determination using the 
novel chain terminators. 
3 5 Still another object of the present invention is to 

provide a method of synthesizing oligo- or polynucleotides 
by means of the novel chain terminators. 
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Another object of the present invention is to provide 
a process of preparing novel chain terminators according 
to the invention. 

In accordance with the invention, these objects are 
5 achieved by the provision of a chain terminating 

nucleotide or nucleotide analogue having its 3 1 -hydroxy 1 
group protected by an acetal or thioacetal structure 
designed in such a way that the 3 • -hydroxy 1 can be 
deprotected in a relatively short time in dilute acid, 
10 such as hydrochloric acid at pH 2, for example. 

In one aspect, the present invention therefore 
provides a compound of the general formula I: 



15 



o o x 

HO-P-O-P-O-P-O 
OH OH OH 



B 



v-7 



20 



R.-Z-C-L^A) - <L 2 ) 1- (Q)m- ( F >n 
R 2 



25 or a salt thereof, such as a trimethylammonium, ammonium, 
sodium or potassium salt, wherein 
B is a nucleobase, 

X and Z independently are oxygen or sulphur, 

Y is hydrogen, hydroxy or protected hydroxy, such as 
30 methoxy, ethoxy or allyloxy, 

Rl is hydrocarbyl, which optionally is substituted 
with a functional group, 

R2 is hydrogen or hydrocarbyl, which optionally is 
substituted with a functional group, 
35 A is an electron withdrawing or electron donating 

group capable of moderating the acetal stability of the 
compound I via Li, 
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Li and L2 are hydrocarbon linkers, which may be the 
same or different, L2, when present, being either (i) 
connected to Li via the group A, or (ii) directly 
connected to L^, the group A then being bound to one of 
5 linkers Li and L2/ 

F is a dye label, 

Q is a coupling group for F, and 

1, m and n independently are O or 1, with the proviso 
that 1 is 1 when m is 1, and 1 is 1 and m is 1 when n is 
10 1. 

In Formula I above, the nucleobase B may be natural 
or synthetic. Natural nucleobases include common 
nucleobases, such as adenine, guanine, cytosine, thymine 
and uracil, as well as less common nucleobases, such as 

15 xanthine, hypoxanthine or 2-aminopurine . Synthetic 

nucleobases B are analogues to the natural nucleobases and 
capable of interacting with other nucleobases in a 
specific, hydrogen bond determined way. 

The hydrocarbyl groups represented by Ri and R2 

20 include a wide variety, including straight and branched 
chain alkyl, alkenyl, aryl, aralkyl and cycloalkyl, 
preferably containing up to 10 carbon atoms. Preferred 
hydrocarbyl groups are primary, secondary or tertiary 
alkyl, alkenyl or alkynyl groups, especially lower alkyl 

25 groups, such as methyl and ethyl. The optional functional 
group substituents on and R2 are capable of moderating 
the lability of the 3 ' acetal group through an inductive 
effect. Exemplary of such functional groups are tert . 
amino, nitro, cyano and halogen (fluoride, chloride, 

30 bromide, iodide) . 

Detectable moiety, or label F can be chosen from a 
vast number of such moieties known to those skilled in the 
art. Exemplary such moities are radioactively labeled 
functions, luminescent, electroluminescent or fluorescent 

35 labels, and labels that absorb characteristic visible or 
infrared light. Preferably, F is a fluorescent label. 

Coupling group Q, when n=0, is a reactive group to 
which a label F can be coupled, or, when n=l, is the 
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derivatized residue of a reactive group used for coupling 
linker L*2 to label F. A large number of such coupling 



reactive function thereon are known to those skilled in 
5 the art. Examples of reactive groups are amino, thio, and 
carboxyl. In some cases, the group Q is derived completely 
from the reactive function on the uncoupled F. For 
example, if the coupling reaction is a substitution 
reaction, the group reacting with F being, for example, a 

10 halogen (fluoride, chloride, bromide or iodide), then the 
group Q will be represented by the reactive function 
present on label F. 

For certain applications of the compounds of Formula 
I, as will be described below, no detectable moiety will 

15 be needed, and the label F, and optionally also the group 
Q, can then be omitted. 

Electron withdrawing or donating group A is 
incorporated into the structure as a moderator of acetal 
stability. The group A may represent a part of the chain 

20 L1-A-L2. Representative groups A in that case are amido, 

sulfoxy, sulfone, carbalkoxy (ester) , ether, thioether and 
amino groups. Alternatively, A may be a side substituent 
for the L1-L2 chain, representative groups then being, for 
example, cyano, nitro and halogen (halogen including 

25 fluoride, chloride, bromide and iodide) . In this latter 

case, linker Li will donate structure stretching from the 
acetal carbon to the place of the substitution, and linker 
L2 will donate structure stretching from this substitution 
to the group A. It is to be emphasized that the specific 

30 electron withdrawing and electron donating groups 

mentioned above are only examples and that many more such 
groups are known and obvious to those skilled in the art. 

The structure of hydrocarbon linker Li will be 
selected with regard to the function A, inductive effects 

35 being highly depending on distance. While a straight 

aliphatic (saturated or unsaturated) hydrocarbon chain is 
preferred, branched or cyclic hydrocarbons may be 



groups capable of reacting with and binding label F via a 
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contemplated. Preferably, linker Li has up to 10 carbon 
atoms, more preferably up to 6 carbon atoms. 

The function of hydrocarbon linker L»2 is to provide, 
together with linker Li, A and coupling group Q, for a 
5 sufficiently long distance between the label moiety F and 
the rest of the compound of Formula I. This is required by 
spatial preferences of the enzymes (polymerases) for which 
the compound of Formula I acts as substrate or by the 
necessity of avoiding interaction between the label and 

10 nucleobase B. It is readily understood that the structure 
of linker L2 highly depends on the particular enzyme, 
label F and possibly also nucleobase B that are used. A 
suitable length and structure of linker L2 will therefore 
be selected by the skilled person for each particular 

15 situation. Similarly as for linker Li, a preferred 

structure for linker L2 is a straight aliphatic (saturated 
or unsaturated) hydrocarbon chain, although branched or 
cyclic hydrocarbons may be contemplated. Preferably, 
linker L2 has up to 10 carbon atoms, more preferably up to 

20 6 carbon atoms. 

In cases where the label F and the coupling group Q 
may be omitted, linker L2 may, of course, also be omitted. 

The compounds of Formula I can be deprotected to 
exhibit a free 3 1 -hydroxy group in a relatively short time 

25 under acidic conditions, e.g. with hydrochloric acid at 
about pH 2 . It is understood that by proper selection of 
the groups , R2 , A, Li and L2 with regard to each other, 
the deprotection time may be adjusted to a desired range. 
Under the mentioned acidic conditions, the most preferred 

30 compounds of Formula I will be deprotected within, say, 
0.01 to 15 minutes. 

While the compounds of Formula I may be used as pure 
chain extension inhibitors, or chain terminators, for 
example, in DNA sequencing according to the chain 

3 5 termination method, as is per se known in the art, the 
advantages of the compounds are, of course, better 
benefited from when the convenient deprotection 
capabilities of the compounds are utilized. This is, for 
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example, the case when the compounds I are used in nucleic 
acid sequencing methods based on the sequential 
incorporation and determination of individual nucleotides 
in a growing nucleic acid copy strand as described in, for 
5 example, the aforementioned WO 91/06678, US-A-5 , 302 , 509 , 
DE-A-414 1178 and WO 93/21340. 

Another aspect of the invention therefore provides a 
method for determining the sequence of a nucleic acid, 
which method comprises providing a single- stranded 

10 template comprising the nucleic acid to be determined, and 
at least partially synthesizing a complementary nucleic 
acid molecule in a stepwise serial manner by the addition 
of nucleotides in which the identity of each nucleotide 
incorporated into the complementary nucleic acid molecule 

15 is determined subsequent to its incorporation, wherein 
said nucleotides are compounds of Formula I as defined 
above, and wherein the 3 ' -blocking group is removed from 
the nucleotide after its incorporation to permit further 
extension of the nucleic acid molecule. 

20 In one embodiment, a method for determining the 

sequence of a nucleic acid comprises the following steps: 

(i) providing a single- stranded template comprising 
the nucleic acid to be sequenced, 

(ii) hybridising a primer to the template to form a 
25 template/primer complex, 

(iii) subjecting the primer to an extension reaction by 
the addition of compounds of Formula I with different 
nucleobases B corresponding the four bases A, C, T and G 

or analogues thereof, 
3 0 (iv) determining the type of the compound of Formula I 

added to the primer, 

(v) selectively hydrolysing the acetal protective 
group , and 

(vi) repeating steps (iii) to (v) sequentially and 
3 5 recording the order of incorporation of compounds of 

Formula I. 

The different compounds of Formula I in step (iii) 
may be added in sequence, in which case the four different 
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compounds I may carry the same label F. Alternatively, the 
different compounds I have different labels F and are 
added at the same time. 

In a preferred embodiment of the method of the 
5 invention, the template /primer complex is bound to a 
solid-phase support, such as a sequencing chip, for 
example. The template may be attached to the solid support 
via a binding linker, which, for instance, is ligated to 
the 5 ' -end of the template or incorporated in one of the 
10 ends of the template by polymerase chain reaction (PCR) . 
The binding linker may then be attached to the solid 
support by use of a streptavidin coupling system. 
Alternatively, the primer may be attached to the solid 
support . 

15 The compounds of Formula I may, of course, also 

conveniently be used in so-called mini -sequencing (see 
e.g. Syvanen A-C et al., Genomics £:684-692 (1990). 

As is readily understood by a person skilled in the 
art, the compounds of Formula I may also be used in the 

20 synthesis of nucleotide chains, and another aspect of the 
invention relates to such use. For example, 
oligonucleotides and polynucleotides may be prepared by 
successively coupling compounds of Formula I to each other 
in any desired base order with intervening deblocking and 

25 using a non- template dependent polymerase, such as a 

terminal transferase. For such synthesis, the groups L 2 , Q 
and F in Formula I may, of course, be omitted. 

The compounds of Formula I may be prepared by methods 
known per se. In a further aspect, however, the invention 

30 provides a particular method for the preparation of a 
subgroup of compounds of Formula I by direct 3 * -OH 
protection, and more particularly compounds of Formula la: 
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10 

or a salt thereof, wherein B, X, Z, R^, R2, Q, F, m and n 
are as defined above, and p and q independently are 
integers from 1 to 10, preferably from 1 to 6, by reacting 
a compound of Formula II: 

15 



20 



HO-P-O-P-O-P— O 
OH OH OH 



OH 



II 



or a salt thereof, wherein B and X are as defined above, 
25 with a compound of Formula III: 



30 



R 2 -C=CH- (CE2)p-l-COOR 3 1X j 



wherein R^, R2, Z and p are as defined above, and R3 is 
hydrocarbyl (for example, as defined for R^ and R2 above) 
to produce a compound of Formula IV: 



O O 



X 



11 11 ^ R 
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or a salt thereof, wherein B # X, Z, R^, R2, R3 and p are 
as defined above, optionally reacting the latter with a 
diamine H2N- {CH2 ) q-NH2 # wherein q is as defined above, to 
produce a compound of Formula V: 



o o x 

HO-P-O-P-O-P— O — ■ o D 



OH OH OH 



10 R 2^°\ 

(CH2) p 

X. N ^<CH 2 ) q -NH 2 

0 H 

wherein B, X, Z, R^, R2 , P and q are as defined above, and 
15 optionally coupling a dye label F to the terminal amino 
group . 

In the following, the invention will be illustrated 
by some non-limiting examples. Reference will be made to 
the accompanying drawings, wherein: 

20 Fig- 1 is a reaction scheme for the enolether 

synthesis described in Example 1 below. Intermediate and 
final products corresponding to various values of integer 
"n" in the respective structural formulae are identified 
by numbers indicated within brackets after the n-values 

25 listed under the respective formulae. 

Fig. 2 is a reaction scheme for the preparation of 
3 ■ -acetal-modif ied thymidine described in Example 2 below. 
Prepared final compounds corresponding to various values 
of integer N n n in the respective structural formula are 

3 0 identified by numbers indicated within brackets after the 
n-values listed under the formula. 

Fig. 3 is a graph showing thymidine 3 ' -acetal 
hydrolysis at pH 4 as log % of remaining acetal versus 
time in minutes. 

35 Fig. 4 is a reaction scheme for the one-pot synthesis 

of 3 • -acetal protected deoxynucleotide triphosphates 
described in Example 4 below. Intermediate and final 
products corresponding to various values of integer "n" in 
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the respective structural formulae are identified by 
numbers indicated within brackets after the n-values 
listed under the respective formulae. 

EXAMPLE 1 

5 Synthesis of enolethers for derivatization of nucleotide 

triphosphates (Fig. 1) 

53 ten 1. Esterif ication of ketoacids 

The appropriate ketoacid (1 eq) was added at once to 
a large excess of dry methanol (2 0 eq) , previously treated 

10 with thionyl chloride (0.1 eq) . The homogenous mixture was 
re fluxed overnight, evaporated under reduced pressure, and 
partitioned between saturated sodium hydrogen carbonate 
and dichlorome thane. The combined organic extracts were 
dried with magnesium sulphate, evaporated and the residue 

15 was distilled under reduced pressure to give the 

appropriate methyl ester in a high yield (80% to 95%) . 
The following ketoacids were used: 

1) 4-oxy pentanoic acid (levulinic acid) (Aldrich) 

2) 5-oxy hexanoic acid has not been isolated. Instead a 
20 commercial ethyl ester-derivative of this acid (Merck) was 

transesterif ied in a reaction analogous to the above 
general procedure . 

3) 6-oxy heptanoic acid was obtained from 2 -methyl 
cyclohexanol (Merck) , according to a published procedure 

25 (Org. Synth. 21 i 3-5, (1951)). 

4) 7-oxy octanoic acid was obtained from 2 -acetyl 
cyclohexanone as described (J. Am. Chem. Soc . 70 : 4023-4026 
(1948) ) . 

Steo 2. Synthesis of the dimethoxvacetal derivatives 

3 0 (compounds 1-4 in Fia . 1) 

Benzenesulf onic acid (0.01 eq) was added to each of 
above methyl esters (1 eq) dissolved in methanol (2 eq) 
and trimethylorthof ormate (3 eq) . The dark brown mixture 
was refluxed for 3 h, neutralized by addition of dry 

3 5 triethylamine (0.1 eq)^ and evaporated. The residue was 
distilled under lowered pressure to yield a pure acetal . 
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Compound 1. The physical and NMR data for this acetal 
correspond w 11 to the data reported previously. 

Compound 2. Yield 85%, bp. 120° (15 ram Hg) *H NMR (CDC1 3 ): 
5 1.28 (s, 3 H) , 1.60-1.69 (m, 4 H) , 2.34 (t, 2 H) , 3.17 (s. 
6 H) . 3 .67 (s, 3 H) . 

Compound 3. Yield 92%. bp. 130-133° (15 mm Hg) NMR 
(CDCI3): 1.25 (s, 3 H), 1.29-1.35 (m, 2 H) , 1.60-1.70 (m, 
10 4 H), 2.33 (t, 2 H), 3.17 (s, 6 H) , 3.67 (s, 3 H) . 

Compound 4. Yield 87%, bp. 147-149° (15 mm Hg) ^ NMR 
(CDCI3): 1.25 <s, 3 H), 1.27-1.38 (m, 4 H) , 1.57-1.67 (m. 
4 H), 2.31 (t, 2 H) , 3.16 (s, 6 H) . 3.66 (s, 3 H) . 

15 

gi-go 1. synthesis of e nnl^f.hers (compounds 5-8 in Fiq , 1 ) ■ 

A mixture of the appropriate acetal (1 eq) and 
trimethylortoformate (0.5 eq) was placed in a distillation 
flask, equipped with a 20 cm Vigreux column. 
20 Benzenesulfonic acid (0.02 eq) was added and the mixture 
was refluxed. The heating was regulated so that the 
liberated methanol evaporated slowly. After 4 h the 
heating was increased and the dark mixture was 
fractionated under reduced pressure without previous 
25 neutralization of the acidic catalyst. The yields of 

isolated colorless enolethers were in all cases very high 
(85-95%) but GC and NMR analyses showed the presence of 
starting acetal in proportions from 10 to 3 0 %. Since 
these impurities were not expected to influence the 
30 derivatization of nucleotides no attempts were made to 
purify them further. The complete assignment of NMR 
signals was difficult because of the presence of starting 
material and the fact that these asymmetric enolethers 
exist as several isomeric forms. Nevertheless, in all 
35 spectra a vinylic signal of CH 2 from one isomer at around 
4.4 to 4.5 ppm and a vinylic signal of CH from the other 
isomer at 3.85 ppm could be easily observed. 
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EXAMPLE 2 

Synthesis of 3' acetal-modif led thymidine (Fig. 2) 

The 5 1 -protected thymidine, 5 ' -FMOC T (FMOC-f luorenyl 
methoxycarbonyl) , (116 mg, 0.25 mmol) was dried by 
5 coevaporation with dry acetonitrile (10 ml) and dissolved 
in dry dioxane (5 ml) . To this magnetically stirred 
solution an appropriate enolether (0.5 ml, =10 eq) 
prepared in Example 1 was added, followed by 
trif luoroacetic acid (20 ml, 1 eq) . After 45 min, the TLC 

10 analysis (Silicagel 60 F254, 10% methanol in chloroform) 
showed complete consumption of the starting material, and 
dry triethylamine (2 ml) was introduced in order to remove 
the base-labile FMOC-group. After this process was 
completed (60 min) , the whole mixture was evaporated under 

15 reduced pressure. The nucleoside was separated from the 

excess of enolether by precipitation from petroleum ether, 
and the precipitate, dissolved in 5 ml of dry methanol, 
was treated with 1, 3-diaminopropane (2 ml, 100 eq) at 60° 
for 180 min in order to effect the aminolysis of the ester 

20 function on the acetal moiety. Finally, the reaction 

mixture was evaporated under low pressure (oil pump) and 
the crude material was flash chromatographed on a silica 
gel column, equilibrated in ethanol and using a step 
gradient of cone, ammonium hydroxide (0-10%) in ethanol. 

25 The material, homogeneous by TLC (developed in 1-butanol - 
ammonium hydroxide 8:2), was combined, evaporated, and 
coevaporated with toluene, to give the NMR pure product 
(compounds 9-12 in Fig. 2) in high yield (72 to 85%). The 
pure material was stored as a stock solution in methanol 

30 after addition of three drops of ammonia. 



Acidic hydrolysis (pH 4) of thymidine, substituted at the 
3 '-position with different acetyl groups 

A reference acetate buffer pH 4.0, prepared by mixing 
35 of solutions of sodium acetate (0.20 M, 18.0 ml) with 

acetic acid (0.20 M, 82.0 ml) was used in all hydrolysis 
studies. To this buffer (10.0 ml) an ethanolic solution 
of the appropriate thymidine 3 * -acetal derivative prepared 
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in Example 2 (compounds 9-12 in Fig. 2) (100 ml) was added 
with gentle stirring. At different time points a sample of 
0.5 ml was withdrawn and placed in a tube containing 15 ml 
of cone, ammonia to raise the pH to 9 . Samples were 
5 analyzed by FPLC using an anion exchange column (Mono Q - 
Pharmacia Biotech AB) and a gradient system of 
tetraethylammonium bicarbonate pH 8.2 for elution (0.05 to 
0.75 M) . The peak areas of starting material and its 
hydrolysis product thymidine were integrated, and plotted 
10 as shown in Fig. 3. 

EXAMPLE 4 

One pot -synthesis of deoxynucleotide triphosphates , 
protected at their 3 • -OH position by a functionalized 

acetal group (Fig. 4) 

15 a commercial deoxynucleotide triphosphate (pppdT, 

pppdC, pppdG, or pppdA) was chromatographed on a 
preparative Mono Q column using a gradient system of 
tetraethylammonium bicarbonate pH 8.2 (0.05 to 1.3 M) to 
obtain the pure triphosphate in the form of a 
20 triethylammonium salt. This material was evaporated on a 
Rotavapor, coevaporated with dry acetonitrile (3x2 ml), 
and dissolved in molecular sieve-dried trimethylphosphate 
(0.5 ml). The appropriate enolether (compounds 5-8 in Fig. 
1) (0.2 ml) and trif luoroacetic acid (10 eq as a 10% 
25 solution in dry dioxane) were added. The homogeneous 

mixture was incubated at 20°C for 60 min, neutralized by 
the addition of triethylamine (100 ml), and precipitated 
from a 1:1 mixture of petroleum ether and diethyl ether. 
The oily precipitate was dissolved in methanol (2 ml) and 
30 1.3-diaminopropane (0.5 ml) or another diamine (1,4- 

diaminobutane, 1 . 6-diaminohexane) (0.5 ml) was added. The 
ester function was subjected to aminolysis overnight at 
60°C. The mixture was again precipitated from 1:1 
petroleum ether and diethyl ether, washed with diethyl 
3 5 ether, and dissolved in water. The water solution was 

analyzed and preparatively purified on the described anion 
exchange column. The well resolved products (compounds 
13-28 in Fig. 4) always appear prior to the original 
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35 





deoxynucleotide triphosphate, having a retention time 
comparable to that of the appropriate deoxynucleotide 
diphosphate {as was found from separate coinjection 
experiments). An aliquot of the isolated 3'-acetal- 
modified derivative was evaporated, treated with 80% 
acetic acid for 2 min, and, after evaporation of acid, 
reinjected to the same chromatographic system. In all 
cases all starting material was reacted and a single 
product with higher retention time was formed that 
corresponded to the original deoxynucleotide triphosphate. 
Thymidine triphosphate reacts to form, besides the desired 
3' -modified derivative, also another product with even 
shorter retention time but that also hydrolyses to the 
starting triphosphate during acid treatment. It is assumed 
15 that this product is the bis-3'-0, 4-0-acetal-derivative 
of thymidine triphosphate. This type of side products was 
not present in reactions that used other nucleotide 
triphosphates. It should also be stressed that very little 
products of depurination were formed in reactions in which 
20 pppdG and pppdA were used. This can be explained by the 
mild acid applied as a catalyst, the large excess of 
enolether used in the reaction, and the fact that the 
bases existed in an unprotected (more acid resistant) 
form . 

25 EXAMPLE 5 

Coupling of a fluorophore to the 3 • -f unctionalised 
deoxynucleotide triphosphate (Pig. 4) 

This derivatization can be performed with a variety 
of reactive fluorophores that may differ in their 
reactivity and in their optimal reaction conditions. Here 
is described the procedure for labelling of 3 ' - 
functionalized deoxynucleotide triphosphate with 
fluorescein isothiocyanate . 

The appropriate nucleotide triphosphate, possessing 
an amino group attached to the 3 ' - position of the sugar 
residue via an acetal function (compounds 13-28 in Fig. 
4), was evaporated and dissolved in 0.1 M carbonate buffer 
pH 10 (0.5 ml). Fluorescein isothiocyanate (single isomer) 
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(10 eq) f dissolved in dimethylf ormamide (0.25 ml) , was 
added and the mixture was incubated overnight at 20°C. The 
. reaction mixture was applied on a prototype FPLC Superdex® 
gel filtration column (Pharmacia Biotech AB) (equivalent 
5 to Sephadex® G-10; Pharmacia Biotech AB) , equilibrated and 
run in TEA HCO3 buffer (0.1 M) . The fluorescent material 
appearing in the void was collected (compounds 29-44 in 
Fig. 4). As expected, the original deoxynucleotide 
triphosphate was formed upon action of acid on the 
10 f luorescein- labelled nucleotide. 

EXAMPLE 6 

Stepwise enzymatical incorporation of 3 9 -modified 

triphosphates 

Four oligonucleotides were prepared, having sequences 
15 corresponding to the M13 sequencing primer, and designed 
to incorporate to their 3 1 -ends T, G, A or C base 
respectively : 

(T) 5 ' CGACGTTGTAAAACGACGGCCAG (23 mer) 

(G) 5 ' CGACGTTGTAAAACGACGGCCA (22 mer) 

20 (A) 5 ' CGACGTTGTAAAACGACGGCC (21 mer) 

(C) 5 ' CGACGTTGTAAAACGACGGC (20 mer) 

Each of them was radioactively labelled with 3 2p 
using T4 polynucleotidekinase . 

In order to investigate if the modified nucleotide 

25 triphosphates are accepted effectively and specifically by 
the T7 polymerase, each of the labelled primers was 
subjected to the single step extension conditions. These 
reaction mixtures (20 ml) were composed of the respective 
primer (0.1 pmol) , M13 template (1 pmol) , and 3 1 -modified 

3 0 thymidine triphosphate (compound 16 in Fig. 4) in a buffer 
system pH 7.5 containing Tris/HCl (40 mM) , MnCl2 (4 mM) , 
dithiothreitol (DTT) (11 mM) , sodium isocitrate (29 mM) 
and sodium chloride (23 mM) . The mixtures were denatured 
at 70°C, cooled, and incubated at 42°C after the addition 

35 of T7 DNA polymerase (10 U) for 3 0 min. All reactions were 
stopped by addition of f ormamide (20 ml) , denatured at 
70°C, cooled on ice, and separated on a 6% denaturing 
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sequencing gel. The developed autoradiogram showed that 
one-base extension had taken place solely in the reaction 
containing 23-mer-T (designed to prime the M13 template 
and incorporate thymidylic acid at its 3' -end). None of 
5 the other reactions containing primers designed for 

incorporation of dG, dA and dC, respectively, showed any 
sign of primer extension. 

EXAMPLE 7 

Sequencing reaction with modified chain terminator 

10 All sequencing reactions were performed according to 

the standard ABI protocol (Taq Dye Primer Cycle Sequencing 
Kit - protocol 901482). 

The sequencing reactions consisted of the following 
components: M13 mp 18 ss-DNA template (1 pmol) , 5'- 

15 fluorescein labelled universal primer (1 pmol, Pharmacia 
Biotech AB) , Ampli Taq DNA polymerase (4U) and T- 
termination mixture in cycle sequencing buffer (Tris-HCl 
pH 8.9, 80 mM, ammonium sulfate 20 mM, magnesium chloride 
5 mM) . Instead of the standard ddT triphosphate, compound 

20 16 in Fig. 4, in different proportions with TTP, was used 
as a chain terminator. After the cycling was completed, an 
equal volume of formamide was added and the mixture was 
heated for 2 min at 90°C, cooled on ice and loaded onto 6% 
acrylamide urea gel. Separation and analysis of products 

25 were performed on an A.L.F. DNA-sequencer (Pharmacia 

Biotech AB) . 

As a result, a pattern of fluorescent-labelled 
sequences was obtained. This electrophoretic picture was 
specific for the applied terminator. 
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1. A compound of the general structure I 



O O X 
HO-^-O-P-O-P-O-^O B 
OH OH OH \ 7 



R t — Z-C-L^CA) - (1*2 ) 1" (Q>m- < F > n 



15 or a salt thereof, wherein 
B is a nucleobase, 

X and Z independently are oxygen or sulphur, 
Y is hydrogen or hydroxy, which optionally may be 
protected, 

20 R X is hydrocarbyl, which optionally is substituted 

with a functional group, 

R 2 is hydrogen or hydrocarbyl, which optionally is 
substituted with a functional group, 

A is an electron withdrawing or electron donating 
25 group capable of moderating the acetal stability of the 
compound I , 

Lx and L-2 are hydrocarbon linkers, which may be the 
same or different, L>2 , when present, being either (i) 
connected to L;l via the group A, or (ii) directly 
3 0 connected to L i# the group A then being connected to one 
of linkers Li and L 2 / 
F is a dye label, 

Q is a coupling group for F, and 

1, m and m independently are 0 or 1, with the proviso 
35 that 1 is 1 when m is 1, and 1 is 1 and m is 1 when n is 
1. 
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2. The compound according to claim 1, wherein and R2 
are independently selected from primary, secondary and 
tertiary allcyl, optionally substituted with one or more 
functional groups. 

5 

3. The compound according to claim 2, wherein Ri and R2 
are selected from methyl and ethyl. 

4. The compound according to claim 1, 2 or 3 , wherein A 
10 is a part of a chain L1-A-L2 and is selected from amido, 

sulfoxy, sulfone, carbalkoxy, ether, thioether and amino. 

5. The compound according to claim 1, 2 or 3 , wherein A 
is a substituent to a chain L1-L2 and is selected from 

15 cyano, nitro and halogen. 

6. The compound according to any one of claims 1 to 5, 
wherein Li is a straight aliphatic hydrocarbon chain, 
preferably having up to 10 carbon atoms. 

20 

7. The compound according to any one of claims 1 to 6, 
wherein L2 is a straight aliphatic hydrocarbon chain, 
preferably having up to 10 carbon atoms. 

25 8. The compound according to any one of claims 1 to 7 , 
wherein Y is alkoxy. 

9. Use of compounds according to any one of claims 1 to 
8 in nucleic acid synthesis reactions, especially for 

30 preparing oligo- or polynucleotides. 

10. Use of compounds according to any one of claims 1 to 
8 in minisequencing. 

3 5 11. A method for determining the sequence of a nucleic 
acid, comprising providing a single-stranded template 
comprising the nucleic acid to be determined, and at least 
partially synthesizing a complementary nucleic acid 
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20 



molecule in a stepwis serial manner by the addition of 
nucleotides in which the identity of each nucleotide 
incorporated into the complementary nucleic acid molecule 
is determined subsequent to its incorporation, wherein 
said nucleotides are compounds of Formula I as defined in 
any one of claims 1 to 8, and wherein the 3 ■ -blocking 
group is removed from the nucleotide after its 
incorporation to permit further extension of the nucleic 
acid molecule. 

12. The method according to claim 11, wherein the 3'- 
blocking group is removed by acid hydrolysis. 



13. The method according to claim 11 or 12. wherein F in 
15 Formula I is a fluorescent label. 

14. The method according to any one of claims 11 to 13, 
wherein said single-stranded template is bound to a solid 
support . 



15. A process of preparing a compound of Formula la: 



o o x 
HO -P-O-P-O-P-O-t/O O 

25 OH OH OH \ / 

O BRi 

(CH2) p 

>^-(CH 2 ) q -(Q)m-( p >n 
O [j 

30 

ar a salt thereof, wherein B, X. Z, R lf R 2 , Q. F. m and n 
are as defined in claim 1, and p and q independently are 
integers from 1 to 10, by reacting a compound of Formula 
35 II: 
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O Q 



HO-P-O-P-O-P— O — I JD^B II 



OH OH OH 




OH 



or a salt thereof, wherein B and X are as defined above, 
with a compound of Formula III : 



10 

R 2 -C=CH- (CH 2 )p_l-COOR 3 m 



wherein Ri, R2/ Z and p are as defined above, and R3 is 
15 hydrocarbyl, to produce a compound of Formula IV: 




B 



OO 
11 11 ii 
HO-P-O-P-O-P-0 

OH OH OH 

20 °V-ZRl IV 

R 2 \ 

(CH 2 ) p 

COOR3 

or a salt thereof, wherein B, X, Z, R]_, R2 , R3 and P are 
25 as defined above, optionally reacting the latter with a 

diamine H2N- (CH2 ) q -NH2 , wherein q is as defined above, to 
produce a compound of Formula V: 



30 



O O x 
II II I' 
HO -P-O-P-O-P— O 

OH OH OH 




V 



.2R X 



35 



( C »2)p 

<C. N /<CH2) q -NH2 
U H 



/ 
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wherein B, X, Z, R 1# R 2 , P and q are as defined abov , and 
optionally coupling a dye label F to the t rminal amino 
group . 



5 



/ 



1/5 




PCT/SE96/00096 



en 

CO 



o 
o 

o 



o 

CO 

X 

o 



o 



o 

CO 

X 

o 



o 
o 
o 
o 
I 

c 
X 

o 
o 

CO 



2 

LL. 



t- cm n v 



CM W ^ 
II II II 

c c c 



in 
ii 




O 
O 
O 
O 

i 



CM 

X 

CO O 

5 T 

o-o 
ii 

o 

CM 



If) (O N 00 



CM CO ^ 
II II II 

c c c 



m 
ii 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 96/23807 




PCT/SE96/0Q096 



3/5 



CO 

W p) m 

ii ii n ii (D 
c c c c EE 

□ ♦no 




SUBSTITUTE SHEET (RULE 26) 




PCT/SE96/00096 



4/5 



X 

o 
o 
e o 

O 

o'b - 

CO 



< 

UJ 



I 

o 



< 

111 




(25) 


Jo 


(27) 


(28) 


to 


in 


m 


m 


II 


II 


ii 


ii 


c 


c 








(22) 


(23) 


(24) 










II 


ii 


II 


11 


c 










CO 


ST 


(20) 


CO 


CO 


CO 


CO 


II 


11 


ii 


ii 


c 


c 




c 






In 


to 


CM 


CNi 


CM 


CM 


II 


II 


11 


II 


c 


c 




c 


< 


o 


O 


t- 


11 


II 


II 


II 


CO 


CD 


m 


CD 



SUBSTITUTE SHEET (RULE 26) 



WO 96/23807 




o 



o=Q--b 

O X 

1 ~ < 

0=Q--0 LU 
I H 
O 

o=o--b 

I 

o 




PCT/SE96/00096 



■ 

o 

o 

d 
u. 





(42) 


(43) 


(44) 


in 


If) 


m 


in 


ii 


II 


II 


ii 


c 




c 


c 


(37) 


(38) 


(39) 


(40) 










II 


II 


II 


Kl 


c 




c 




CO 

co. 


(34) 


(35) 


(36) 


CO 


CO 


CO 


CO 


II 


II 


II 


II 


c 




c 




ST 


(30) 


T" 

CO 


(32) 


CM 


CM 


CM 


CM 


II 


II 


ii 


11 


c 


c 


c 


C 


< 


o 


O 


1- 


II 


II 


ii 


II 


CD 


CD 


ffi 


m 



SUBSTITUTE SHEET (RULE 26) 




asm, ^B«aa««amaMKaMK 





INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/SE 96/00096 



A. CLASSIFICATION OF SUBJECT MATTER 



IPC6: C07H 19/10. C07H 19/20, C12Q 1/68 

According io International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC6: C07H, C12Q 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 

SE,DK,FI,N0 classes as above 



Electronic data base consulted during the international search (name of data base and. where practicable, search terms used) 



CA. WPI. IFIPAT 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



P,A 



["^~| Further documents are listed in the continuation of Box C. See patent family annex. 



Citation of document, with indication, where appropriate, of the relevant passages 



STN International, File CA, Chemical abstracts, 
Volume 124, Canard, B.: "Catalytic editing 
properties of DNA polymerases", abstract no. 
49135, & Proc. Natl. Acad. Sci . U. S. A. (1995), 
92(24), 10859-63 



WO 9423064 Al (INSTITUT PASTEUR), 13 October 1994 
(13.10.94) 



EP 0399330 Al (B0EHRINGER MANNHEIM GMBH), 
28 November 1990 (28.11.90) 



Relevant to claim No. 



1-15 



1-15 



1-15 



* Special categories of cited documents 

*A* document defining the general state of the art which is not considered 

to be of particular relevance 
9 E" ertier document but published on or after the international filing date 
*L* document which may throw doubts on priority claim(s) or which is 

cited to establish the publication date of another citation or other 

special reason (as specified) 
"O* document referring to an oral disclosure, use, exhibition or other 

means 

*P" document published prior to the international filing date but later man 
the priority date claimed 



T* later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

'X* document of particular relevance: me claimed invention cannot be 
considered novel or cannot be considered to involve an inventive 
step when the document is taken alone 

*Y* document of particular relevance the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 
document member of me same patent family 



Date of the actual completion of the international search 



17 Anril 1996 



Name and mailing address of the ISA/ 
Swedish Patent Office 
Box 5055. S-102 42 STOCKHOLM 
Facsimile No. + 46 8 666 02 86 



Date of mailing of the international search report 

16.04.96 



Authorized officer 

Eva Johansson 

Teleph ne No. +46 8 782 25 00 



Form PCT/ISA/210 (second sheet) (July 1992) 




INTERNATIONAL SEARCH REPORT 




ial application No. 
PCT/SE 96/00096 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



EP 0628568 A2 (BOEHRINGER MANNHEIM GMBH), 
14 December 1994 (14.12.94) 



EP 0286028 A2 (BOEHRINGER MANNHEIM GMBH), 
12 October 1988 (12.10.88) 



W0 9106678 Al (SRI INTERNATIONAL), 16 May 1991 
(16.05.91) 



US 5302509 A (P.C. CHEESEMAN), 12 April 1994 
(12.04.94) 



1-15 



1-15 



1-15 



1-15 



Form PCT/ISA/210 (continuation of second sheet) (July 1992) 




INTERNATIONA1T3MRCH REPORT 

Information on patent family members 




01/04/96 



International application No. 

PCT/SE 96/00096 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(t) 



Publication 
date 



W0-A1- 9423064 



13/10/94 



NONE 



EP-A1- 0399330 



28/11/90 



SE-T3- 

AT-T- 

CA-A- 

DE-A- 

DE-D- 

ES-T- 

JP-A- 

JP-B- 

JP-A- 



0399330 
116327 
2017369 
3916871 
59008105 
2068279 
3005495 
7030108 
7233188 



15/01/95 
24/11/90 
29/11/90 
00/00/00 
16/04/95 
11/01/91 
05/04/95 
05/09/95 



EP-A2- 0628568 14/12/94 NONE 



EP-A2- 0286028 



12/10/88 



SE-T3- 


0286028 




AT-T- 


124415 


15/07/95 


AU-B.B- 


597483 


31/05/90 


AU-A- 


1439888 


13/10/88 


CA-A- 


1311201 


08/12/92 


CN-A- 


88102038 


26/10/88 


DE-A- 


3739366 


27/10/88 


DE-D- 


3854060 


00/00/00 


JP-A- 


7048396 


21/02/95 


JP-A- 


63275598 


14/11/88 


ZA-A- 


8802446 


29/09/88 



W0-A1- 9106678 16/05/91 CA-A- 2044616 27/04/91 

EP-A,A- 0450060 09/10/91 



US-A- 5302509 12/04/94 NONE 



Form PCT/1SA/210 (patent family annex) (July 1992) 



